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Abstract. This article is an introduction for the nonpractitioner to the ideas 
and issues of LISA data analysis, as reflected in the explorations and experiments 
of the participants in the Mock LISA Data Challenges. In particular, I discuss 
the methods and codes that have been developed for the detection and parameter 
estimation of supermassive black-hole binaries, extreme mass-ratio inspirals, and 
Galactic binaries. 
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1. Introduction 

The Laser Interferometer Space Antenna (LISA) is a joint NASA-ESA mission to 
measure gravitational waves (GWs) in the low-frequency band between a few 10~ 5 
and 10 -2 Hz. LISA consists of a triangular constellation of spacecraft, flying in a 
solar orbit, and exchanging laser beams to monitor their relative distances (~ 5 x 10 6 
km) to picometer precision [1]. The LISA dataset will contain GWs from millions 
of sources, most of them in our Galaxy, but many in the distant Universe, as far as 
z ~ 20. Some sources, such as Galactic binaries, have simple waveforms; some, such 
as the inspirals of supermassive black holes (SMBHs), will be so strong that they 
will stand out clearly in the dataset. Other sources, such as the extreme-mass-ratio 
inspirals (EMRIs) of compact objects into central galactic BHs, generate exceedingly 
rich, complicated, but relatively weak waves that we will need to dig out from the 
LISA noise by matching their phases accurately across ~ 10 5 cycles. The millions of 
compact binaries in our Galaxy will merge in a confusion-limited foreground that we 
will be able to characterize statistically, and from which we will individually resolve a 
fraction of louder or higher-frequency sources. 

It is clear from this picture that effective data-analysis techniques will be crucial 
to the success of this mission, which hopes to deliver an impressive list of scientific 
payoffs, ranging from characterizing the hierarchical growth of galaxies and their 
central BHs across the history of the Universe, to testing the strong-gravity limit 
of general relativity in MBH mergers; from mapping the Kerr geometry of spacetimc 
around MBHs, to surveying the demographics of compact binaries in our Galaxy; and 
much more. (The LISA science-case document [2] is a very good place to start learning 
and to find extensive references about the science that will be enabled by LISA.) In 
fact, LISA data analysis should be construed as an integral part of its measurement 
concept, since LISA's true science data will be the counts and parameters of the GW 
sources it detects. 
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The Mock LISA Data Challenges (MLDCs) [4-8], begun in 2006, are a 
community-wide effort to demonstrate and advance LISA data-analysis capabilities, 
understanding its conceptual and quantitative peculiarities, and kickstarting the 
development of algorithms and software. In each round of the MLDCs, challenge 
datasets are released that contain GW signals from sources of undisclosed parameters, 
embedded in synthetic LISA noise; challenge participants have a few months to analyze 
the data and submit detection candidates, which are then compared with the sources 
originally injected in the datasets. ( Training datasets with public source parameters 
are also provided, to help participants tune and troubleshoot their searches.) 

At the time of this symposium, three MLDC rounds have been completed: MLDC 
1 [4, 5] focused on establishing basic techniques to observe GWs from compact Galactic 
binaries, isolated or moderately interfering; as well as from the inspiral phase of bright 
and isolated nonspinning MBH binaries. The more ambitious MLDC 2 [6,7] featured 
a dataset containing GW signals from approximately 26 million Galatic binaries 
generated with a population-synthesis code; a dataset (the "whole enchilada") with 
a similar population, plus GW signals from a small number of nonspinning MBH 
binaries and EMRIs; and five more datasets with single-EMRI signals. MLDC IB [8] 
was a repeat of MLDC 1 (with the addition of single-EMRI datasets), meant as a 
second entry point to the MLDCs for new, less experienced participants. Last, MLDC 
3 [8], now in progress, marks the inclusion of more realistic source models (such as 
chirping Galactic binaries and spinning MBH binaries) and of new types of sources 
(such as short-lived bursts and stochastic backgrounds). 

This article is a brief introduction to the ideas and techniques of LISA data 
analysis, as exemplified by the published reports authored by MLDC participants 
about their work. Collectively, those reports illuminate a broad swath of the state of 
the art in this field. More informal discussions are also available in the form of the 
technical notes submitted with MLDC entries [9]. I shall focus on the methods, rather 
than their quantitative results; for those, see [5,7-9], and the MLDC articles cited 
below. Furthermore, I shall concentrate on the analysis of LISA data, as opposed to 
the equally crucial modeling of LISA sources. The approach taken in the MLDCs is 
that the development of data-analysis algorithms can proceed in parallel to source- 
modeling research, by assuming that currently available waveforms and source models 
can be used as stand-ins for more accurate versions that will be available by the time 
LISA returns data. 

In the rest of this paper, I briefly describe the peculiarities of LISA data 
analysis compared to ground-based observatories (section 2), and I discuss the 
plans to detect and characterize the three class of sources that have so far been 
the focus of the MLDCs: SMBHs (section 3), EMRIs (section 4), and Galactic 
binaries (section 5). Thus, I do not cover the recent progress toward other LISA 
data-analysis objectives, such as detecting bursts from cosmic strings [10] and 
characterizing stochastic backgrounds (see [11] for the basic detection principles, [12] 
for a simple implementation, and [13] for the prospects of constraining anisotropy and 
polarization). 

2. The LISA dataset 

A good way to begin to understand the peculiarities of LISA data analysis is to contrast 
the LISA dataset with those of ground-based experiments such as LIGO [14]. 
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The dataset is dominated by GW signals, whereas ground-based detector data are 
dominated by instrument noise. Thus individual LISA sources must be detected 
over the background of many others, which raises issues of confusion between similar 
signals, and complicates the attribution of statistical significance to detections. 

The LISA dataset is small — 5 x 10 8 samples (1 Hz bandwidth x 5 yr operation x 
3 independent GW observables) , to be compared with ~ 4 x 10 11 samples for the 
one-year LIGO S5 run alone. The difference originates mainly in the 10 3 : 1 ratio 
of bandwidths, and it points to a different character of data analysis: a fine-tuning 
job of fitting many source parameters to relatively few bits for LISA, rather than a 
necdle-in-the-haystack data-mining problem for ground-based detectors. 

The LISA response is complex, and it is key to LISA 's directional sensitivity. For 
both LISA and ground-based detectors, incoming GW signals are periodically Doppler 
shifted by the motion of the detector around the Sun, and modulated by the rotation 
of its sensitive axes. Because most LISA sources are long lasting, these effects imprint 
GWs with information about the sky location and orientation of sources. By contrast, 
sources for ground-based detectors are short lived (with the exception of pulsars), so 
sensitivity to source position is provided by the triangulation of times of arrival in a 
detector network.| Thus, the signal models used in LIGO searches do not generally 
include sky position as a parameter, while the signal models for LISA must. 

One LISA is equivalent to a network of three synthesized GW interferometers. To 
cancel the otherwise overwhelming laser frequency noise, the phase measurements 
taken between LISA spacecraft must be carefully assembled into the observables of 
time-delay interferometry (TDI; see [15] for an introduction), which complicates the 
LISA response even more. Many different observables can be formed; however, when 
all six laser links are operational, LISA can be described as producing two independent, 
GW-sensitive measurements for frequencies below 2 x 10 -3 Hz, and three above. § 

3. Supermassive black-hole binaries 

Observations of galactic nuclei from the ground and space suggest that SMBHs reside 
at the center of most galaxies. Hierarchical structure formation models explain SMBH 
growth as the result of the mergers of central BHs in merged galaxies [16]; we observe 
many of the latter. LISA is expected to detect GWs from a few to a hundred SMBH 
inspirals per year, with masses between 10 4 and a few 10 7 M Q , and from redshifts as 
far as z ~ 20 [17]. 

Signal modeling. Binary-coalescence waveforms have been customarily divided in the 
phases of inspiral, merger, and ringdown; rather than a true physical distinction, 
however, this division has echoed a specialization of effort in the GR community, and 
the use of different mathematical techniques to compute the corresponding pieces of 
waveforms (viz., post-Newtonian equations, the direct numerical integration of the 
Einstein equations, and perturbation methods on BH backgrounds). 

As for data analysis, searches on data from ground-based experiments have so far 

\ Triangulation is relevant to LISA in the higher-frequency part of its sensitive band, where GW 
wavelength is comparable to the LISA armlengths, so the LISA output is complicated by significant 
phase delays between the response of the three spacecraft to the waves. 

§ A simple way to think about this is that three LIGO-like interferometers can be formed with the 
three LISA spacecraft, but at long wavelengths the sum of the three signals must be zero by symmetry, 
which removes one independent measurement. 
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targeted inspiral and ringdown waveforms (from binaries of BHs, as well as other stellar 
remnants; see, e.g., [18,19], because theoretical models for these partial waveforms 
have been easily available as analytical functions of the source parameters. Numerical 
relativity's recent success in robustly computing several GW cycles from a variety of 
merger configurations (see [20] for the breakthrough results, and [21] for a review) has 
rekindled the interest in implementing inspiral-merger-ringdown searches for ground- 
based experiments. Given the substantial computational cost of merger waveforms, 
these searches rely on analytical waveform families obtained by fitting to "stitched" 
post-Newtonian and numerical waveforms, computed for a small number of parameter 
sets [22]. The collaboration between numerical relativists and GW data analysts 
has recently become tighter thanks to the Numerical INJection Analysis (NINJA) 
project [23], whereby numerical waveforms produced by ten different codes have been 
injected in synthetic instrument noise, and then searched for using the LIGO- Virgo 
Collaboration inspiral and burst pipelines, as well as more heterodox techniques such 
as the Hilbert-Huang transform. 

The experience accreted in ground-based searches for BH binaries will transfer 
directly to LISA data analysis, not least because waveforms for binaries in the LISA 
frequency range, with total masses M — 10 4 -10 8 M Q , are time-rescaled versions of 
the waveforms for stellar-mass binaries in the ground-based range (this is because M 
is effectively the only timescale in the problem, while all other source parameters are 
or can be made dimcnsionless). However, additional work will certainly be needed 
on the source-modeling side, because waveform-accuracy requirements are much less 
stringent for ground-based detectors than for LISA: the former will yield detections 
at modest signal-to-noise ratios (SNR) that limit the resolving power of searches and 
therefore increase tolerable waveform errors; the latter will detect SMBH binaries with 
much higher SNRs, so accurate waveforms (including the effects of orbital eccentricity 
[24] and spin interactions and waveform corrections at higher post-Newtonian order 
[25]) will be required to estimate source parameters reliably [26] and to avoid biasing 
the detection of other weaker sources in the LISA data. 

Matched filtering. The possibility of computing accurate waveforms for BH binaries 
as functions of a small number of parameters, as well as the phase coherence of 
these signals across their entire duration, make them ideal targets for detection by 
matched filtering (see, e.g., [27]). This is the gold standard of GW data analysis, 
in which a set of theoretical waveform models (templates), computed for source- 
parameter values spread across their likely ranges, are individually subtracted from the 
measured detector output, and the statistical likelihood of the residuals is evaluated 
with respect to a model of detector noise, assumed additive. If the noise is Gaussian 
and stationary, the likelihood is a function of a single number, the SNR after filtering 
p(hi) = (s, hi) I yj (hi, hi): here s is the noisy signal, hi is a template, and the inner 



is a generalization of a time-domain correlation product. Equation (1) is weighted by 
the noise's one-sided power spectral density S n (f) toward the frequencies at which 
the detector is most sensitive. A detection can be claimed for template hi when p(h{) 
is larger than a threshold p* chosen to yield a negligible rate of false alarms. In fact, 
GW signals will "ring" several templates in a neighborhood of parameter space, so 
the detection is attributed to the template with the maximum SNR. 



product 




(1) 
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In the practice of ground-based searches, detector noise can be glitchy and 
nonstationary, so additional detection criteria are introduced: coincidence between 
different detectors, data-quality vetos that exclude noisy or problematic stretches of 
data, and signal-based vetos such as % 2 , which compares the contribution of different 
frequency bands to p. Moreover, the statistical value of indicators such as p is not 
assumed a priori from noise models, but it is derived from a frequentist analysis 
of backgrounds (how often do high-SNR candidates result from noise alone?) and 
detection efficiencies (how often does the search pipeline recover GW signals artificially 
injected into the data?). It is unclear to what extent such cautions will be needed in 
LISA searches: while they are certainly less important for high-SNR detections, little is 
known at this time about the Gaussianity and stationarity of the LISA noise, and a full 
understanding may be possible only at the time of commissioning. The availability 
of multiple TDI observables will certainly play a role in the eventual solution [28]. 
From the viewpoint of LISA data-analysis development, it is therefore reasonable to 
concentrate early efforts on Gaussian-noise searches, which in any case make up the 
basic framework and dictionary for more careful statistical studies. 

Template placement. The most important decision that must be made in implementing 
a matched-filtering search is how to distribute the templates to be tested across source- 
parameter space. Ground-based searches for GWs from nonspinning compact-object 
binaries have so far placed templates along structured grids (rectangular, or hexagonal) 
in the plane of the two binary masses mi and 7712. The grids are set densely enough 
that, for any true GW signal present in the data, the nearest template would still 
recover a large percentage (the minimum match) of the optimal SNR achieved by a 
perfectly matching template. In practice, the placement is performed by finding new 
coordinates ci(mi,m 2 ) and 02(1711,1712) such that a regular grid in c\ and c 2 achieves 
the minimum-match constraint approximately homogeneously across the mass plane. 

Building such reassuringly exhaustive template banks has been possible for signal 
models that depend only on two parameters — technically there are a few more, such 
as the strength, time of occurrence, and initial phase of the inspiral waves, but p can 
be maximized analytically over these extrinsic parameters without need for distinct 
templates to be placed along their ranges. For larger number of intrinsic parameters, 
producing homogeneous banks becomes complicated, and the curse of dimensionality 
leads to impractically numerous banks, because the size of a regular lattice grows 
exponentially with the dimension of space. This will generally be the case in LISA 
searches of GWs from binaries, since the observed waveforms will be functions of the 
binary's masses and spin vectors, and of its sky position and orientation. 

Stochastic searches. A possibility, adopted in LIGO's S3 search for spinning binaries 
[19], is to employ stochastic banks of randomly placed templates [29]. Another 
attractive strategy that avoids banks altogether is to generate one or more stochastic 
paths that roam about parameter space, visiting different parameter sets with 
probabilities proportional to the Bayesian posterior probability of a true signal having 
those parameters. This can be achieved with Markov-Chain Monte Carlo (MCMC) 
schemes|| [31]: at every step (sample) along the path (chain), a proposal is made for 
the next location to visit by drawing from a distribution that is a function only of 
the current location; the proposal is then accepted with a probability proportional 

|| The literature on the application of MCMCs to GW detection is too vast to cite here. To my 
knowledge, the first contribution on this subject is [30], in the context of enabling Bayesian parameter 
inference after the detection of binary inspirals. 



A LISA Data-Analysis Primer 



6 



to the ratio of posteriors for the proposed and current locations. MCMCs are in 
fact integration schemes that yield the moments of the posterior, and therefore the 
uncertainties in the determination of source parameters. Their convergence varies 
from run to run, and depends crucially on the choice of proposal distributions, but 
roughly speaking the schemes converge as one over the square root of the number of 
independent locations; since nearby samples along the path are not independent, the 
final computations can be made on a subset of samples separated by the correlation 
time of the chain, usually several tens of steps. 

Typical MCMC runs for GW parameter-estimation problems visit millions of 
parameter sets [32]; however, if the focus shifts to using MCMCs as searches (that 
is, to locating the maximum of the posterior), they can be made faster by factors 
~ 100 by deploying a vast array of tricks [33] that accelerate the motion of the chain 
toward the maximum.^ Among these, annealing consists in progressively increasing 
the contrast of the target distribution, starting with a "hot" chain that can take bold 
steps across parameter space, and then cooling it on a preset schedule. In so-called 
frequency annealing, the search starts off with shorter signal templates, which are 
faster to compute and again reduce contrast; templates are then extended as the 
search begins to converge. In so-called thermostated heating, the dynamic range of the 
posterior is compressed at its high end, to avoid getting stuck on secondary maxima. 
Another useful strategy is reducing the number of parameters by maximizing the 
posterior analytically over the extrinsic parameters with the ".T 7 " statistic [34,35]. 

As mentioned above, the choice of the proposal distribution is paramount, 
especially because there can be significant correlations among the changes in the 
waveforms due to changing different source parameters 6 l . A basic technique (helpful 
also for parameter estimation) is rescaling local jumps by the eigenvectors and 
eigenvalues of the Fisher matrix {dh/d0 % ,dh/d6^)\ strictly speaking, this matrix 
approximates the curvature of the likelihood surface around a maximum, but here 
it is used to identify the correct scale of parameter displacements with respect to 
waveform changes, and to disentangle parameter correlations. The proposals include 
also long speculative jumps across the entire allowable parameter ranges, and targeted 
jumps that exploit the nonlocal symmetries of the waveforms (e.g., changing the chirp 
mass and time of coalescence together to move from the primary posterior maximum 
to secondary unconnected maxima) or of the LISA response (e.g., moving to antipodal 
points in the sky, which produce essentially the same LISA response at low frequencies) 
[33]. MCMC schemes have been applied successfully to searching for SMBH binary 
signals in MLDC 1 [36,37], 2 [7], and IB [8]. 

The fact that MCMC searches are so efficient in locating the maximum of the 
posterior indicates that it is not found on a narrow, isolated peak, but rather on a larger 
mountain that can be climbed semirandomly by the chain. It should be then possible 
to exploit such a structure to organize a template-bank-based search of efficiency 
comparable to MCMCs, using hierarchical banks that become increasingly dense and 
concentrated in promising regions, or that are laid out alternatingly along separate 
subsets of parameters. + Indeed, the authors of [38] implemented both strategies 

% Most of these tricks do not, as is often stated incorrectly, remove the Markov property of the chain 
(its memorylessness) ; rather, they destroy the asymptotic convergence of the chain to the target 
posterior distribution. 

+ Conversely, if the maximum was really on an isolated peak, MCMC and bank methods alike would 
require a number of templates on the order of the ratio between the volume of parameter space and 
the footprint of the peak. 
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(as well as stochastic template placement) for their MLDC IB entry, and succeeded 
in limiting template evaluations to a few 10 4 , as were needed in MCMC searches; 
[39] achieves a similar result using sophisticated stochastic template placement with 
location-dependent densities. As mentioned above, bank-based searches have a 
reassuring value in searching exhaustively over the entire parameter space; it will 
be interesting to see if they can be extended to spinning-binary signals, which bring in 
several new parameters. We note also that bank-based searches do benefit from a final 
MLDC refinement of the best-fit parameters (see also [37] for an MLDC 1 composite 
search that joins TF track fitting, a bank-based search, and a final MCMC). 

4. Extreme mass-ratio inspirals 

EMRIs involve compact stellar objects in galactic nuclei, which have entered a region 
of phase space where their further evolution is dominated by GW emission; thus 
EMRIs are by definition doomed to inspiral and plunge into the central galactic SMBH. 
The requirement that EMRIs be detectable by LISA constrains the SMBH mass to 
10 4 -10 7 Mq, and the nature of the compact object to a neutron star, white dwarf, or 
stellar-mass BH, which can approach the SMBH closely without being disrupted. See 
[40] for a review of EMRI astrophysics. 

EMRIs will typically emit GWs in the LISA band for <~ 10 5 cycles, over times 
comparable to the LISA lifetime, and with signal strengths an order of magnitude 
below the LISA noise, and a factor of a few deeper below the Galactic-binary 
foreground. EMRI signals can be understood as the waveforms emitted by test 
particles moving on geodesies and creating small perturbations (oc the mass ratio) 
in the Kerr geometry of the central spinning SMBH. The perturbations travel out to 
infinity as GWs, but they also scatter off the background geometry and act back on the 
test particle, which is slightly deviated off its geodesic: this is the "self-force" picture 
of EMRI motion. Indeed, it is BH perturbation theory that enables the computation of 
EMRI orbits and waveforms: by contrast, EMRI orbital velocities are too relativistic 
for post-Newtonian calculations to be applicable, whereas the large number of orbits 
makes numerical-relativity runs unaffordable. 

EMRI waveforms. See Drasco [41] for a taxonomy of the approaches that have been 
developed to compute EMRI waveforms, from the most accurate (but not yet achieved) 
self- force waveforms, to Barack and Cutler's "analytical kludge" waveforms [42], which 
enhance Keplerian orbits (and the corresponding Peters-Mathews GW emission [43]) 
by adding post-Newtonian pericenter- and orbital-plane-precession terms. These 
waveforms capture the qualitative behavior of EMRI waveforms, such as the GW 
emission at multiple harmonics of the radial frequency modified by the frequency of 
periastron precession, each with sidebands at multiples of the frequency of orbital- 
plane precession. While parameter estimation may eventually require the waveforms 
on the more accurate end of Drasco's classification, recent investigations of EMRI 
detection (including those in MLDC 2 and IB) have used analytic-kludge waveforms 
as stand-ins, because they can be generated much more rapidly, and they are more 
transparent to insight on the effect of the source parameters on the evolution of 
frequencies. 

Semicoherent searches. The first quantitative investigation of EMRI searches [44] 
concluded that, because of the large number of source parameters (fourteen, of which 
perhaps seven are especially important), and of the large number of GW cycles that 
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must be matched coherently in a matched-filtering search, it would be unfeasible 
to deploy a template bank with all the required signal shapes. The authors of 
[44] suggested instead a semicoherent search whereby bank-based matched filtering 
would be applied on shorter data segments of three weeks, and the resulting SNRs 
summed incoherently along paths in parameter space corresponding to the expected 
evolution of EMRIs along the inspiral. This strategy is analogous to the stack-slide 
method developed for all-sky searches of periodic sources with LIGO [45]. Template 
banks would still be very large (10 12 ), and the computational load very taxing (many 
tcraflops yr). The reach (maximum detection distance) of such a search would be 
roughly half the reach of optimal filtering, but would still allow tens to hundreds of 
detections. 

Time-frequency searches. More efficient if less powerful searches can be carried out by 
looking for the tracks left by EMRI signals in the time-frequency (TF) spectrograms 
of LISA data. Such a program has been developed over the last few years by Wen, 
Gair, and coauthors, who began by looking for single bright pixels in the TF plane [46] 
(in analogy to the excess power method for ground-based searches [47] ) , using multiple 
choices of pixel timespan and frequency bandwidth. Such a search has roughly half the 
reach of the semicoherent strategy, and it is clearly limited by its focus on the brightest 
single-harmonic, quasimonochromatic stretch of EMRI signal. This limit is overcome 
by an extension of the method, the Hierarchical Algorithm for Clusters and Ridges 
[48], which groups nearby bright pixels in clusters, much like the TFClusters algorithm 
[49] used in LIGO burst searches, enhancing the reach of the single-pixel method by up 
to 20%. In addition, the shape of the clusters carries much useful information, which 
can be used [50] to estimate the intrinsic parameters of the EMRI (from the evolution 
of the frequencies and the separation of sidebands) and its sky position (from the 
modulations in the strength of the track in the different TDI observables throughout 
the inspiral) . The extended method was applied to the EMRI Challenges in MLDC 2 
and IB [50], where it was able to estimate all intrinsic parameters (and for MLDC IB, 
also the sky position) to a few percent, and much better for the fundamental radial 
frequency. 

Altogether, TF searches are very fast (they run in minutes); they are applicable 
also to other bright LISA sources, including SMBH and Galactic binaries [48]; and 
they may serve as a first stage in a hierarchical search, segueing into targeted matched 
filtering. However, all explorations so far have been for isolated sources in Gaussian 
LISA noise, and there is a question of how performance may be reduced in the 
realistic case, with multiple intersecting tracks from instrument glitches and other 
strong sources (SMBH binary inspirals, the brightest Galactic binaries, and multiple 
simultaneous EMRIs). Even if the track- identification problem proves intractable in 
general, the brightest sources will still stand out above the confusion. 

Stochastic searches. The high dimension of EMRI parameter space calls naturally 
for a stochastic approach to matched filtering. Indeed, MCMC codes developed to 
search for SMBH binaries in MLDC data have been adapted for EMRI waveforms 
[51,52] (see also the earlier example [53]). The main problem in the adaption is to 
secure a sufficiently broad sampling of parameter space, so that the chain is given a 
chance to wander sufficiently close to the true posterior maximum, and at the same 
to avoid getting stuck on a secondary maximum — there are as many of these as there 
are ways for the multiple TF tracks of two EMRI signals to find themselves in partial 
constructive interference for widely separated parameter sets. 
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To this purpose, the authors of [51,52] have again employed tricks ranging 
from temperature annealing to time annealing (equivalent to frequency annealing 
for SMBHs) and thermostated heating. Multiple proposal distributions are used, 
including long speculative jumps followed by a reconnaissance search in the 
neighborhood of the landing site before deciding whether to accept the jump [52]; 
jumps limited to intrinsic, extrinsic, or phase parameters [51]; and "island hopping," 
constrained jumps designed to maintain gross waveform features such as track 
geometry. The implementation of this feature in [51] is especially interesting: strong 
harmonic features are first identified with small MCMC runs; the subsequent search is 
constrained to parameter sets that would reproduce them, using parameter relations 
valid for analytic-kludge waveforms. 

Trials on MLDC datasets [8, 51, 52] have shown that MCMC searches are capable 
of detecting moderately strong, isolated signals with good parameter-estimation 
performance. Much as for TF searches, it remains to be seen which of the strategies 
discussed here will be effective for weaker EMRI signals, and in the presence of 
confusion noise, of other LISA sources, and of other EMRIs. (MLDC 3 does include 
a search for five simultaneous, weaker EMRIs.) As in the case of SMBH searches, 
convergence-acceleration techniques are incompatible with the estimation of the full 
posterior, so subsequent, longer runs without tricks will be needed. If the mountains 
around the secondary maxima hold significant probability, they must be included 
in parameter uncertainties in addition to the width of the global maximum, so we 
will need to ensure that the parameter-estimation MCMCs visit them fairly. It is 
likely that some sort of island hopping will be needed for this purpose; thus, it is 
important to develop an understanding of the (approximate) discrete symmetries of 
EMRI parameter space that conserve the properties of EMRI signals. See [54] for the 
beginnings of such an understanding. 

5. Galactic binaries 

The dominant GW contribution in the LISA band will come from short-period binaries 
of compact stellar objects (white dwarfs, neutron stars, black holes, and naked 
He stars), predominantly from the tens of millions of such systems in our Galaxy 
[55,56], but also from farther afield [57]. We know from astronomical observations 
that a few known Galactic binaries are sufficiently strong to be detected by LISA 
with the matched-filtering SNR accreted in a few months after commissioning [58]; 
these verification binaries will provide the first test of the correct operation of the 
instrument. 

Throughout its lifespan, LISA will be able to detect and characterize a few 10 4 
additional unknown resolvable binaries [59], but most systems will be either too 
weak or too crowded to be detected individually, and will form a confusion-limited 
foreground that will act as effective noise for the detection of other sources. Confusion 
will begin roughly below ~ 3 mHz, where the binary density reaches 0.5 per FFT 
bin [55,60]. Thus, most resolved binaries will be found above 3 mHz, with some at 
lower frequencies if they are sufficiently massive or close to us that they stand out 
from the crowd. After removing these, the confusion foreground is left with statistics 
very similar to those of colored Gaussian noise [59] , with some imprints of having been 
emitted from the direction of the Galaxy [61]. 

Matched filtering. Compared to SMBHs and EMRIs, Galactic-binary signals are 
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simple: by the time they enter the LISA band, orbits have overwhelmingly become 
circularized, and frequency evolution is slow enough that it can be matched accurately 
by a linear model. However, the resulting quasimonochromatic signals are modulated 
by the LISA response: the LISA motion around the Sun smears the central frequency / 
into a bandwidth of 300 x (//3 mHz) nHz, and the rotation of the LISA triangle creates 
eight sidebands, displaced symmetrically by frequency multiples of 1/year ~ 32 nHz 
(see, e.g., [35]). Thus, simple spectral analysis is insufficient, and matched filtering 
again comes to the fore. The T statistic can be used to maximize the likelihood 
analytically over the extrinsic parameters, so that templates can be distributed only 
over the remaining three (/ and the sky-position angles), plus / if / > 1.6 mHz. 
Indeed, Prix, Whelan, and Khurana ran such searches on MLDC 1, 2, and IB 
data [9,62], adapting software from the LIGO searches for continuous GWs, and 
implementing a sophisticated pipeline that includes a first wide-parameter ^-statistic 
search run separately on multiple TDI channels, a coincidence sieve, and a final 
zoomed-in search that is run in parallel over the TDI channels. 

Source confusion. Unfortunately, any search that treats binary signals individually 
must contend with the effects of source confusion: signals that are close in parameter 
space have nonzero correlations, so the template bank will not "ring" at the 
correct parameter locations, but rather at points displaced by systematic confusion 
errors. A local analysis of the multisource likelihood surface shows that parameter- 
estimation accuracy decays exponentially with the number of overlapping signals, and 
superexponentially with the degree of correlation [63] . A strong source may also hide 
a weaker one, or two sources of comparable strength may fuse into one. 

Iterative detection schemes have been developed to deal with this problem: 
gCLEAN identifies the brightest source in the data and subtracts a small fraction 
of it; this is repeated until a prescribed residual power is reached, and all signals 
are then reconstructed from the record of subtractions [64]. "Slice and Dice" again 
identifies the brightest source, but subtracts it completely; then the parameters of all 
the sources identified so far are refit simultaneously by a global least squares, and 
resubtracted with the corrected parameters to provide the data for the next step; the 
process ends when the residual becomes consistent with noise [65] . 

Global jits. A more radical approach is searching for all sources at the same time, by 
maximizing the posterior probability over a (N x M)-dimensional manifold, where N 
is the number of sources and M the number of parameters. Template-bank schemes 
to do so become quickly unmanageable, so once again the community turned to 
MCMCs. Umstaetter and colleagues [66] tackled an idealized problem that does not 
include the LISA response; their paper is a very good introduction to the statistical 
underpinnings of such a search. Cornish and Crowder were then able to develop 
an effective MCMC LISA search, at first for a few binaries [67], then for a realistic 
number of resolvable systems [68] . The crucial realization is that signals that are well 
separated in frequency have no significant overlap, so the global fit of ~ 10 4 binaries 
can be tackled in frequency-domain blocks that contain ~ 100 sources, taking special 
precautions not to miss those at the boundaries between blocks. Once again, the 
convergence of the chains can be accelerated with tricks such as annealing, multiple 
proposal distributions, and source-tailored special moves. The Cornish-Crowder code 
was spectacularly demonstrated in MLDC 2, where it identified 19,324 binaries out of 
~ 26 x 10 6 ; template-bank searches were able identify roughly a factor of ten fewer 
[7]- 
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Heterodox alternatives. In closing, I like to mention genetic searches ([69], 
demonstrated in MLDC 1 [70]), which are similar in spirit to MCMC, but espouse 
metaphors of mutation and selection rather than those of statistical physics; and 
tomographic recostruction ([71], applied to MLDC 1 and 2 [7,72]), which exploits the 
fact that the instantaneous LISA response can be expressed as a Radon transform 
[73] of the signal from all Galactic binaries (roughly speaking, an integral over all sky 
locations at times retarded by propagation to the LISA position); the transform can 
be inverted, yielding a somewhat smeared map of the sky as a function of frequency. 

6. In conclusion 

The heritage from ground-based detectors and the demonstrations given in the MLDCs 
have already gone a long way in providing solid foundations for the LISA science 
objectives. LISA data analysis is a vibrantly active field with many intellectual 
challenges still open, and the MLDCs have proved a stimulating platform to develop 
and test new and established ideas alike. MLDC contributions (covered only partially 
in this article) provide an ideal springboard to begin exploring the field. 
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